34 research outputs found
Tuning and optimization for a variety of many-core architectures without changing a single line of implementation code using the Alpaka library
We present an analysis on optimizing performance of a single C++11 source
code using the Alpaka hardware abstraction library. For this we use the general
matrix multiplication (GEMM) algorithm in order to show that compilers can
optimize Alpaka code effectively when tuning key parameters of the algorithm.
We do not intend to rival existing, highly optimized DGEMM versions, but merely
choose this example to prove that Alpaka allows for platform-specific tuning
with a single source code. In addition we analyze the optimization potential
available with vendor-specific compilers when confronted with the heavily
templated abstractions of Alpaka. We specifically test the code for bleeding
edge architectures such as Nvidia's Tesla P100, Intel's Knights Landing (KNL)
and Haswell architecture as well as IBM's Power8 system. On some of these we
are able to reach almost 50\% of the peak floating point operation performance
using the aforementioned means. When adding compiler-specific #pragmas we are
able to reach 5 TFLOPS/s on a P100 and over 1 TFLOPS/s on a KNL system.Comment: Accepted paper for the P\^{}3MA workshop at the ISC 2017 in Frankfur
Quantitatively consistent computation of coherent and incoherent radiation in particle-in-cell codes - a general form factor formalism for macro-particles
Quantitative predictions from synthetic radiation diagnostics often have to
consider all accelerated particles. For particle-in-cell (PIC) codes, this not
only means including all macro-particles but also taking into account the
discrete electron distribution associated with them. This paper presents a
general form factor formalism that allows to determine the radiation from this
discrete electron distribution in order to compute the coherent and incoherent
radiation self-consistently. Furthermore, we discuss a memory-efficient
implementation that allows PIC simulations with billions of macro-particles.
The impact on the radiation spectra is demonstrated on a large scale LWFA
simulation.Comment: Proceedings of the EAAC 2017, This manuscript version is made
available under the CC-BY-NC-ND 4.0 licens
Optical control of the refractive index of a single atom
We experimentally demonstrate the elementary case of electromagnetically
induced transparency (EIT) with a single atom inside an optical cavity probed
by a weak field. We observe the modification of the dispersive and absorptive
properties of the atom by changing the frequency of a control light field.
Moreover, a strong cooling effect has been observed at two-photon resonance,
increasing the storage time of our atoms twenty-fold to about 16 seconds. Our
result points towards all-optical switching with single photons
Spectral Control via Multi-Species Effects in PW-Class Laser-Ion Acceleration
Laser-ion acceleration with ultra-short pulse, PW-class lasers is dominated
by non-thermal, intra-pulse plasma dynamics. The presence of multiple ion
species or multiple charge states in targets leads to characteristic
modulations and even mono-energetic features, depending on the choice of target
material. As spectral signatures of generated ion beams are frequently used to
characterize underlying acceleration mechanisms, thermal, multi-fluid
descriptions require a revision for predictive capabilities and control in
next-generation particle beam sources. We present an analytical model with
explicit inter-species interactions, supported by extensive ab initio
simulations. This enables us to derive important ensemble properties from the
spectral distribution resulting from those multi-species effects for arbitrary
mixtures. We further propose a potential experimental implementation with a
novel cryogenic target, delivering jets with variable mixtures of hydrogen and
deuterium. Free from contaminants and without strong influence of hardly
controllable processes such as ionization dynamics, this would allow a
systematic realization of our predictions for the multi-species effect.Comment: 4 pages plus appendix, 11 figures, paper submitted to a journal of
the American Physical Societ
Bayesian feedback control of a two-atom spin-state in an atom-cavity system
We experimentally demonstrate real-time feedback control of the joint
spin-state of two neutral Caesium atoms inside a high finesse optical cavity.
The quantum states are discriminated by their different cavity transmission
levels. A Bayesian update formalism is used to estimate state occupation
probabilities as well as transition rates. We stabilize the balanced two-atom
mixed state, which is deterministically inaccessible, via feedback control and
find very good agreement with Monte-Carlo simulations. On average, the feedback
loops achieves near optimal conditions by steering the system to the target
state marginally exceeding the time to retrieve information about its state.Comment: 4 pages, 4 figure
Analyzing quantum jumps of one and two atoms strongly coupled to an optical cavity
We induce quantum jumps between the hyperfine ground states of one and two
Cesium atoms, strongly coupled to the mode of a high-finesse optical resonator,
and analyze the resulting random telegraph signals. We identify experimental
parameters to deduce the atomic spin state nondestructively from the stream of
photons transmitted through the cavity, achieving a compromise between a good
signal-to-noise ratio and minimal measurement-induced perturbations. In order
to extract optimum information about the spin dynamics from the photon count
signal, a Bayesian update formalism is employed, which yields time-dependent
probabilities for the atoms to be in either hyperfine state. We discuss the
effect of super-Poissonian photon number distributions caused by atomic motion.Comment: 12 pages, 13 figure
cupla - C++ User interface for the Platform independent Library Alpaka
cupla [qχɑpˈlɑʔ] is a simple user interface for the platform independent parallel kernel acceleration library alpaka. It follows a similar concept as the NVIDIA® CUDA® API by providing a software layer to manage accelerator devices. alpaka is used as backend for cupla
cupla - C++ User interface for the Platform Independent Library alpaka
cupla [qχɑpˈlɑʔ] is a simple user interface for the platform independent parallel kernel acceleration library alpaka. It follows a similar concept as the NVIDIA® CUDA® API by providing a software layer to manage accelerator devices. alpaka is used as backend for cupla
EZ publication: source code, profiling, analysis and simulation data
Data of the PIConGPU simulations as used in the publication: EZ: An Efficient, Charge Conserving Current Deposition Algorithm for Electromagnetic Particle-In-Cell Simulations
Data overview:
picongpu_source.zip:
source code forked from the PIConGPU mainline version 0.7.0-dev
used input set `share/picongpu/examples/PaperThermal`
runs_charge_conservation.zip:
output including hdf5 dumps to validate charge conservation property for the PaperThermal setup (warm plasma)
runs_performance.zip:
simulation timings output for Spock CPU, Spock GPU and Summit GPU runs
runs_profiling.zip:
profile data for Spock GPU and Summit GPU runs
runs_singleParticleTest.zip:
output including hdf5 dumps to validate charge conservation property for the single particle test
analysis_scripts.zip:
jupyter notebooks for setup and analysis of PaperThermal setup
python script to plot charge conservation from hdf5 simulation output over time
bash script for statistical analysis of performance run
PIConGPU setup: PWFA simulations
This is the PIConGPU source code and setup files for generating PWFA simulations. This setup was used to study wake elongation.This is a simulation setup accompanying a experimental study